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Abstract 



In this paper we explore first passage percolation (FPP) on the Erdos-Renyi random graph 
^S| I Gn{Pn), where each edge is given an independent exponential edge weight with rate 1. In the 

CN ' sparse regime, i.e., when np„ — > A > 1, we find refined asymptotics both for the minimal 

weight of the path between uniformly chosen vertices in the giant component, as well as for 
m^J • the hopcount (i.e., the number of edges) on this minimal weight path. More precisely, we prove 



a central limit theorem for the hopcount, with asymptotic mean and variance both equal to 

A/(A — l)logn. Furthermore, we prove that the minimal weight centered by logn/(A — 1) 

converges in distribution. 
C^ . We also investigate the dense regime, where np„ — ?> oo. We find that although the base 

graph is a ultra small (meaning that graph distances between uniformly chosen vertices are 

o(logn)), attaching random edge weights changes the geometry of the network completely. 

Indeed, the hopcount iJ„ satisfies the universality property that whatever be the value of Pn, 

^ i Hn/ log n — )- 1 in probability and, more precisely, (if„— /3„ log n) / y/logn, where /3„ = A„/(A„ — 

^^ ' 1), has a limiting standard normal distribution. The constant /3„ can be replaced by 1 precisely 

^D ■ when A„ ^ v^Iogn, a case that has appeared in the literature (under stronger conditions on 

A„) in [2, 12]. We also find bounds for the maximal weight and maximal hopcount between 

vertices in the graph. This paper continues the investigation of FPP initiated in [2] and [3] . 
^£^ \ Compared to the setting on the configuration model studied in [3], the proofs presented here 

are much simpler due to a direct relation between FPP on the Erdos-Renyi random graph 

and thinned continuous-time branching processes. 
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graph, flows, first passage percolation, hopcount 
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1 Introduction 

First passage percolation is one of the most fundamental problems in probability theory. The basic 
motivation to study this problem is the following. The goal is to model the flow of fluid through 
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some random medium. Suppose that we have a base graph on n vertices which represents the 
available pathways for the fluid. We attach to each edge in the graph some random edge weight, 
typically assumed to be independent and identically distributed (i.i.d.) positive random variables 
with some probability density function /. We then think of fluid percolating through the network 
at rate 1 from some source. Letting re — )• oo, one is then interested in asymptotics of various 
statistics of the flow through this medium. See e.g., [10, 13] for results and a survey on the integer 
lattice. 

Our aim in this paper is to rigorously analyze first passage percolation (FPP) on the Erdos-Renyi 
random graph (ERRG) denoted by Gn{pn)- We shall see that for two randomly chosen vertices in 
the giant component the hopcount, i.e., the number of edges on the s/iortesi- weight path between 
these vertices, scales as \ogn and we shall find a central limit theorem (CLT) for this quantity. 
We shall also find that the weight of the shortest-weight path re-centered by a constant multiple 
of log re converges in distribution to some limiting random variable. We shall describe the explicit 
distribution of the limit. We shall also find lower bounds for the maximal optimal weight and 
hopcount between vertices in the giant component. 

In [3], we have investigated FPP on the configuration model (CM) with degrees given by an 
i.i.d. sequence with distribution function F, satisfying F{x) = 0, x < 2. Consequently, all degrees 
are at least 2 and the giant component contains n — o{n) vertices, so that with high probability 
(whp), two uniformly chosen vertices are connected. Furthermore, it was assumed that for all 
X > 0, there exist constants ci,C2, such that 

cix^~^ < 1 - F{x) < C2X^~^, when r G (2, 3), 

whereas for t > 3, F should satisfy, for all x > 0, 

1 - F{x) < cx^~^, 

for some constant c. Apart from self-loops and multiple edges, the CM, with a binomial degree 
sequence, is not very different from the ERRG. The main challenge in studying the ERRG 
compared to the study of FPP on the CM in [3] is threefold: 

(a) The degree sequence in [3] was assumed to satisfy F(x) = 0, x < 2, so that all degrees are 
at least 2, with probability 1. As said this implies that uniformly chosen vertices are whp 
connected. In the present paper a uniformly chosen pair of vertices does have a positive 
probability of being connected, but this probability does not equal 1, so that we have to 
condition on the uniformly chosen vertices to be in the giant component. Technically, this 
is a big step forwards. 

(b) We deal with the case where the average degree A^ = np„ — ;■ oo, see Corollary 2.4 below, a 
scenario that is not contained in the sparse setting in [3]. This is the first time a result of 
this generality has been proved in the regime rep„ — t- oo. 

(c) The ERRG admits an elegant embedding in a marked branching process in continuous 
time. Consequently, the proofs are short and non-technical compared to those in [3]. This 
technique should prove to be useful in a number of other random graph models. Moreover, 
we include a lower bound on the weight and length of the largest shortest-weight path for 
FPP on the ERRG. 



2 Results 

In this section we formulate our main results. Throughout this paper, we work on the Erdos-Renyi 
random graph (ERRG) G„(p„), with vertex set [n] = {1, . . . ,n} and edge set En = {ii,j) '■ i,j ^ 
[n],i 7^ j}, and where every pair of vertices i ^ j \s connected independently with probability 
Pn- Furthermore, each edge e G En is equipped with an independent weight E^^, having an 

exponential distribution with rate 1. We denote by -^, — )-, and — >, convergence almost 
surely, in distribution, and in probability, respectively. The symbols o, O are the ordinary Landau 
symbols. We say that a sequence of random variables Xn satisfies Xn = Op{bn), Xn = Op{bn), 
respectively, if Xn/bn — > 0, Xn/bn is tight, respectively. We write that a sequence of events 
{£n)n>i occurs With high probability (■whp) when P(<Sn) = 1 — o(l). Binomial random variables 
are denoted by Bin(n,p), where n denotes the number of trials and p the success probability. We 
further notate by Exp(;u), Poi(A), respectively an exponentially distributed random variable with 
rate /x, and a Poisson random variable with mean A. 

In the Theorems 2.1 and 2.2 below, we shall investigate the hopcount and weight of FPP on the 
ERRG. 

Theorem 2.1 (CLT for hopcount) Let Wnin-^oo npn = A > 1 and define (5 = ^— j- > 1. Then, 
the hopcount Hn between two uniformly chosen vertices, conditioned on being connected, satisfies 
a central limit theorem with the asymptotic mean and variance both equal to /31ogn, i.e., 

Hn- 13 log n d „ 
V/^ log "- 

where Z is a standard normal random variable. 

Now we consider the asymptotics of the minimal weight. 

Theorem 2.2 (Limit distribution for minimal ^veight) Let lim„_^oo np^ = A > 1, and de- 
fine 7 = jTrj- Then, there exists a non- degenerate real valued random variable X with distribution 
p, such that the minimal weight Wn between two uniformly chosen vertices, conditioned on being 
connected, satisfies 

W„ - 7 log n — > X. 

Remark 2.3 (Joint convergence) The proof shall reveal that the convergence in Theorems 2.1 
and 2.2 holds jointly, with the limits being independent. 

Let us identify the limiting distribution p. Consider a continuous-time Galton- Watson branching- 
process with Poi(A) offspring, where the individuals have an exponential life time with rate 1. 
Denote the number of alive individuals at time t by N{t). It is well known [1] that there exists 
a > such that e~"^N(t) has an almost sure limit W. For the case under consideration it is 
readily verified that the Malthusian parameter a = A — 1 > 0. Let D denote a Poi(A) random 
variable representing the number of offspring from one individual, and, conditioned on D, denote 
by Ei,E2, ■ ■ ■ ,Ed, the exponential lifetimes of the offspring. Then the limit W satisfies the 
stochastic equation 

D 



W = Y,e~"^'Wi, (2.1) 



i=l 



where on the right-hand side ah the involved random variables are independent with Wi being 
copies of W. Hence taking conditional expectations w.r.t. D, it is seen that (/)(t) = E[e~*^] 
satisfies the functional relation 

(Pit) = E[(E[e-*'=""'''^])^] = exp (-X H [l - 0(fe-(^-^)^)] e-'^dx) . (2.2) 

From this, it is quite easy to prove that the random variable W has an atom at zero of size px, 
where p\ is the smallest non-negative solution of the equation 

Px = exp(-A(l -px)), 

i.e., Px is the extinction probability of a branching process with Poi(A) offspring. Furthermore, 
the random variable W, conditioned to be positive, admits a continuous density on M"*", and we 
denote by 

Wx = {W\W > 0). (2.3) 

To construct p we shall need the following random variables: 

(a) Let W^^\W^^^ be independent and identically distributed as Wx, 

(b) Let E = Exp(l) be independent of Wf . 



In terms of these random variables, the random variable X with distribution p satisfies 

X = -7 log (7iy{^') - 7log (7t^f ) + jlog{E). 



We next study the dense graph setting, where npn = A„ — t- oo. The proof in this setting follows 
the same lines as that of Theorems 2.1-2.2, and therefore we will only give a sketch of proof. 
Observe that for np„, — t- oo, any pair of vertices is whp connected. It is not hard to see that, 
in this case, the giant component consists of n(l — o(l)) vertices, and that the graph distance 
between two uniformly chosen vertices is log7i/logA„ = o(logn), so that the random graph is 
ultra small. In the statement of the result, we denote by Hn and Wn, respectively, the hopcount 
and minimal weight of the shortest-weight path between two uniformly chosen vertices. 

Corollary 2.4 (Limit of hopcount and w^eight for npn — t- oo) Set A„ = npn. For A^ — ?■ oo, 

as n ^ oo, 

(a) 

i?n-/3„logn d 

Vlogn 

where /3„ = An/(A„ — 1) and Z is a standard normal random variable; 

(h) 

{K-l)Wn-\ogn^X, (2.5) 

where the random variable X = M\ -\- M2 — M^, with Mi, M2,M^ independent Gumbel random 
variables, i.e., P(-Mj < x) = A(x) = exp(— e~^), 1 < i < 3, for all x G M. 

Remark 2.5 (Dense setting) (a) In Part (a) of Corollary 2.^ the centering /3„logn can be 
replaced by logn if and only if \n/^/\ogn — t- 00. In this case, the hopcount has the same limiting 
distribution as on the complete graph (see e.g., [12]). 

(b) Note that the distribution of Hn for the case where npn/i}ogn)'^ — )• 00 was obtained in [12]. 
In [2, Theorem 5] limit results for Hn/ log n and npnWn — log n were obtained in the case where 
\iinmfn'^cxiinpn)/{logn) = a, with 1 < a < 00. Thus, the present paper essentially completes the 
study of FPP on ERRGs in all regimes. 



Theorems 2.1-2.2 state that for uniformly chosen connected vertices, the weight and length of the 
optimal path between them scales as 7logn and /31ogn, respectively. The following shows the 
existence of pairs of vertices with a much larger optimal path weight and hopcount: 

Theorem 2.6 (Existence of long paths) Let np^ — )■ A > 1 and define by c{X),d{X) the con- 
stants 

c(A) = -^ + ^^, d{X) = ^ + ^^, (2.6) 

A-1 logl^Al A-1 logl^fAl 

where ^\ is the dual of X, i.e., the unique fi G (0, 1) such that 

^e-f" = Xe-^. (2.7) 

Then for any given e, with high probability, there exists a pair of vertices say i*,j* in the giant 
component such that the weight of the optimal path Wn{i*,j*) and its number of edges Hn{i*,j*) 
satisfy the inequalities 

M^n(i*,j*)>(l-e)c(A)logn, Hn{i*,j*)> {I -e)d{X) log n. (2.8) 

We conjecture that the above result is optimal, i.e., you can get between any pair of vertices 
within weight c(A) log n and with at most d{X) log n hops. 

In [3], we have proven results parallel to those in Theorem 2.1 and 2.2 for the configuration 
model (CM) with degrees given by an i.i.d. sequence with distribution function F, where F{x) < 
cx^~'^ , r > 3, for some constant c and all x > 0. We found that the asymptotic hopcount between 
two uniformly chosen vertices converges to a normal distribution with mean and variance equal 
to ^j^ logn, where v = E[D(L' — 1)]/E[Z)] and where the random variable D is distributed as the 
degree distribution F. Note that in Theorem 2.1 the role of v is taken over by the parameter A. 
This is not surprising, since for a Poi(A) variable D, we have u = K[D{D — 1)]/E[D] = A. 



3 Setting the stage for the proofs 

A rough idea of the proof is as follows. Fix two vertices, say 1 and 2, in the giant component. 
Think of fluid emanating from these two sources simultaneously at rate 1, so that at time t, J-"'''(t) 
is the flow cluster from vertex i and includes the minimal weight paths to all vertices wetted at 
or before t from vertex i, i = 1, 2. When these two flows collide (namely, when the flow from one 
of the sources reaches the other flow) via the formation of an edge (i'i,'L'2) between two vertices 
vi £ J^'^'(-) and V2 G J^*^'(-), then the shortest-weight path between the two vertices has been 
found. This collision time, which we denote by 5i2, tells us that the weight between the two 
vertices, W„(l,2), equals 

Wn{l,2)=2Si2. (3.1) 

Furthermore, if Gn{vi),i = 1, 2, denotes the number of edges between the source i and the vertex 
Vi along the tree J^'''(5'i2), then the hopcount if„(l,2) is given by 

if„(l, 2) = Gn{vi) + Gn{v2) + 1. (3.2) 

The above idea is indeed a very rough sketch of our proof. In the paper we embed the flow 
on the ERRG in a continuous-time marked branching process (CTMBP), where the offspring 
distribution is binomial with parameters n — 1 and p (see Section 4.2). With high probability, the 
marks in the CTMBP correspond to the vertices in the ERRG. We denote by {SWT^^}m>o the 



marks of the individuals wetted by the flow after m sphts and the arrival times of these splits, 
where the superscript i, i = 1,2, denotes the root i of the flow. It is not mandatory to let the 
flows grow simultaneously, and for technical reasons, the proof is simpler if we flrst grow SWTj^-* 
to a size a„ = [-v/raj- After this, we grow SWT[^', and we stop as soon as a mark of SWT'j'^ 
appears in {SWT*^''}J^^q. The size a^ = \\^ is the correct one, since if both flows are of size 
approximately ^/n, the probability that the second flow flnds a mark of the flrst flow in some 
time interval of positive length is of order 1. 

The CLT for the distance Gm between the TTi* -wetted mark and the root as well as the limit 
distribution for the weight of the path between this mark and the root are given in Section 4.3. 
This theory is based on the asymptotics for Bellman-Harris processes which will be developed in 
Section 3.1. 

In Section 4.4 we investigate the connection time C^, i.e., the random time until the second flow 
starting from mark 2 hits the flow of size an, which started from mark 1. We prove that C„/a„ 
converges in distribution to an Exp(l) random variable. This can informally be understood as 
follows. The number of distinct marks in SWT^^' is a,i(l -|- o(l)). Each of these marks is chosen 
with probability 1/n in SWT[^\ so that the first time that any of these marks is chosen is close 
to a geometric random variable with success parameter an/n. This random variable is close to 

n/a„Exp(l), so that indeed Cn/dn — > Exp(l). 

We furthermore show that, conditioned on C„, the hopcount Hn is, whp, the independent sum 
of Gu „ and G^^ . Here GY!' „ denotes the distance between root 1 and a mark in SWT'^' that 
is chosen uniformly at random. 

With the above sketch of proof in mind, the remainder of the paper is organized as follows: 

(1) In Section 3.1 we will analyze various properties of a Bellman-Harris process conditioned on 
non-extinction, including times to grow to a particular size and the generation of individuals 
at this time. In this continuous-time branching process the offspring will have a Poisson 
distribution. 

(2) In Section 4.1, we introduce marked branching process trees with binomially distributed 
offspring and make the connection between these trees and the ERRG, by thinning the 
marked branching process tree. 

(3) In Section 4.2, we replace the general weights on the edges by exponential ones so that 
we end up with a continuous-time marked branching processes (CTMBP) with binomial 
offspring. We further focus on some of the nice properties of the involved random variables, 
that are a consequence of the memoryless property of the exponential distribution. 

(4) By coupling the CTMBP with binomial offspring to a Bellman-Harris process with Poisson 
offspring, we deduce in Section 4.3 the limit theorem for the generation Gm and the weight 
Am from the results in Section 3.1. 

(5) In Section 4.4, we present a refined analysis of the connection time of the two flow clusters. 

(6) Finally, in Section 5, we complete the proofs of our main results. 

The idea of the argument is quite simple but making these ideas rigorous takes some technical 
work because of the issue of conditioning on being in the giant component. 



3.1 Asymptotics for Bellman-Harris processes 

Here we shall construct random trees where each vertex lives for an exponential amount of time, 
dies and gives birth to some number of children. Fix a sequence of non- negative integers di,d2, ■ ■ ■■ 
For future reference, let Si = X]i=i ^-j ~ (^ ~ !)■ We shall make the following blanket assumption 

Si > 0, for all i > 1. (3.3) 

The condition in (3.3) is equivalent to the fact that the tree where the i vertex has degree di is 
infinitely large. Consider the following continuous-time construction of a random tree: 

Construction 3.1 (FPP on a tree) (1) Start with the root which dies immediately giving 
rise to di alive offspring; 

(2) each alive offspring lives for an Exp(l) amount of time, independent of all other randomness 
involved; 

(3) when the i^^ vertex dies it leaves behind di alive offspring. 

In terms of the above construction, we can identify Si as the number of alive vertices after the 
i*^ death. Let Ti, T2, . . . , Tm be the time spacings between the {i — ly^ and i^^ death 1 < i < m. 
We shall often refer to Ai = Ti + T2 + ■■■+ Ti^ 1 < i < m, as the time of the i^^ split. Note that 
at the graph topology level, the above procedure is the same as the following construction, where 
exponential life times do not appear: 

Construction 3.2 (Discrete-time reformulation of FPP on a tree) The shortest-weight - 
graph on a tree with degrees {dj}^]^ is obtained as follows: 

(1) At time 0, start with one alive vertex (the initial ancestor); 

(2) at each time step i, pick one of the alive vertices at random, this vertex dies giving birth to 
di children. 

Let Gm denote the graph distance between the root and a uniformly chosen vertex among all 
alive vertices at step m. We quote the following fundamental result from [5]. ^ 

Proposition 3.3 (Shortest-weight paths on a tree) Pick an alive vertex at time m > 1 uni- 
formly at random from all vertices alive at this time. Then 
(a) the generation of the m*^ chosen vertex is equal in distribution to 



Grn = Y,Iu (3.4) 



i=l 



where {li}'^^ are independent Bernoulli random, variables with 

P(/i = 1) = d^/s^, l<i<m. (3.5) 



^A new probabilistic proof is given in [3] , since there is some confusion in comparing tlie definition Si in this 
paper and the definition of Si given in [5, below (3.1)]. 



(b) the weight of the shortest-weight path between the root of the tree and the m chosen vertex 
is equal in distribution to 



Ti + T2 + ... + Tm = Y^ Ei/si, (3.6) 



d 

i=l 



where {Ti}'^^ are i.i.d. exponential random variables with rate Si and hence Ti is equal in distri- 
bution to Ei/si. 



The above proposition states that the random variable Gm is the sum of m independent Bernoulh 
random variables. It is known that in a wide variety of settings, such quantities essentially follow 
a CLT. We shall be interested in showing that the standardization Gm of Gm converges in 
distribution to a normal random variable Z, and the way we shall prove this is by showing that 
the Wasserstein distance between the distribution Gm and that of Z goes to 0. Denote by Wass 
the metric on the space of probability measures defined as 



Wass(/xi,/i2) = sup 



gd^ii - / gd^ 



: g is 1— Lipschitz and bounded 



It is well known [7, Theorem 11.3.3] that this metric on the space of probability measures induces 
the same topology as the topology of weak convergence. Thus showing that Gm, — > Z is equiv- 
alent to showing that Wass (/i^, A*) — ^ where Gm ~ /"m and Z ~ /i; here X ~ z/ means that the 
law of the random variable X is equal to u. 

The following result is simple to prove using any of the methods for showing CLTs for sums of 
independent random variables, e.g. Stein's method [6]. 

Lemma 3.4 (CLT of hopcount in tree) As before let Gm denote the generation to a uni- 
formly chosen vertex alive at time m. Define the sequence of random variables 

/=V _ '-'m — l^i=l Pi fn 7\ 

'^m — /==^m r^ T' ^ ' 

Vl^i=l Pii^ - Pi) 

where pi = di/si. Let /im denote the distribution of Gm O'^d fi the distribution of a standard 
normal random variable. Then 

Wass{fim,p)< ,^m ^ r, f (3-^) 

V22i=i Pii^ - pi) 

Proof. Combining [6, Theorem 3.1 and 3.2] yields 

m 

Wass{fim,p)<3Y,n\Vi\^]: (3-9) 

i=l 

where 

li - Pi 



and where /j is a Bin(l, pi) random variable. From this we obtain 

m -. m 



i=l 



Er=iP.(i-pop/^tt vnuw^^y 



using pf + {1- piY < 1. ■ 

From now on we take Di, D2, ■ ■ ■ i.i.d. where Dj has a Poi(A) distribution with A > 1. Further- 
more, we consider the continuous time construction of a random tree as in Construction 3.1, with 
di = Di and Si = Yl]=i ^j ~ (^ ~ !)• Since the Dj are now random, for any fixed time m > 1, 
there is now some non-zero probabihty that Sj = for some 1 < j < m. However, note that for 
any m, 

P(Si>0 Vi = 1,2, ...,m) > 1-pA, 

where px is the extinction probabihty of a Gahon- Watson branching process with Poi(A) offspring. 
Let Gm be the generation of a randomly chosen ahve individual among all alive vertices at time 
m, conditional on Si > for all 1 < i < m,. Let Am be the time for the m split to happen in the 
continuous-time construction (see Construction 3.1). Then we have the following asymptotics: 

Theorem 3.5 (CLT for hopcount on trees conditioned to survive) Conditioned on Si > 
for all 1 < i < m, the following asymptotics for m — )• 00 hold: 
(a) the generation Gm satisfies a CLT, i.e., 



Gm - /3 log m d 



V?T^ 



m 



Z, (3.10) 



where Z is standard normal and 13 = ^^:ri/ 

(h) the random variable Am. satisfies the asymptotics: 

^rn. -7logm ^^ -7log7WA, (3.11) 

where W\ has a distribution given by (2.3). The limits in (a) and (b) also hold jointly, where the 
limits are independent. 

The issue of conditioning is a technical annoyance as it removes the independence of increments 
of the random walk {Si^^^. However what should be intuitively clear is that, for a random walk 
with positive drift, conditioned to be positive is, whp, the same as conditioning the path to be 
positive in the first Wm steps, where we assume Wm = o(loglogm). Indeed, by the time Wm, the 
walk has reached height approximately {\ — \)wm and if we just add independent Poisson random 
variables from this stage onwards, then the random walk will, whp, remain positive, since the 
walk has reached a high level by this time, and the probability that it would reach by time m 
is exponentially small in Wm- This idea is made precise in the following construction. First we 
shall need some notation. Fix Wm = o(loglogm) — )• 00 and lei Yi = Di — 1 where the Di are 
i.i.d. Poi(A) random variables. Let Sj = \ + Yll=i ^i- 

Now consider the following construction 

Construction 3.6 Take 5"*, 5J", . . . , S*^^ equal in distribution to Si,S2, ■ ■ ■ , Sw^ conditioned on 
being positive. Furthermore, let l^^+i, 1^^4.2, ■ ■ ■ Ym be i.i.d. distributed as Poi(A) — 1, indepen- 
dent of S^jS^, . . . , S^^ . Define a new sequence Si, ... , Sm by 

S-i ^^' '-"^"^ (3 121 

Then the following proposition yields good bounds on the behavior of this random walk: 



Proposition 3.7 (Good bounds for the conditioned random ^valk) The sequence in Con- 
struction 3.6 satisfies the following regularity properties: 

(a) P(5i > 0, ...jS'm > 0) = 1 — o(l), as m -^ oo, and there exists a coupling such that 
P(5j = Sj, 1 < j < m) = 1 — o(l), as m ^ oo, where Sj, 1 < j < m, is equal in distribution to 
the random walk Rj = 1 + X^i=i(^j ~ ^)A ^ J ^ ^'^j conditioned on Ri > 0, . . . , Rm > 0. 

(b) There exists a constant C > such that, whp, for all j > log log m, 



(A - l)j - C^/7bi7 < Sj < (A - l)j + CVJT^. (3.13) 

(c) Let Pi = Di/Si. Then 

m 

^Pi(l - Pi) = /31ogm + 0p(loglogm). (3.14) 



i=l 



Assuming Proposition 3.7, let us show how to prove Theorem 3.5: 

Proof of Theorem 3.5. In order to prove Part (a), we note that according to Lemma 3.4 it 
suffices to show that J^i^i Pii^ ~ Pi) — >■ oo, as m — )• oo. This is immediate from Part (a) and (c) 
of Proposition 3.7. 

The almost sure convergence of Am = Ti + . . . + Tm in (3.11) is a little more difficult. We refer 
to [1, Theorem 2, p. 120] where it has been proved that the time of the m split, Am, when the 
branching process has Poi(A) offspring satisfies the asymptotics 

Af(^^)e-(^-i)^'" ^ W, (3.15) 

where N{t) denotes the number of alive individuals at time t. Observe that (3.15) can easily 
be deduced from the convergence e~^'^^^^^N{t) -^ W, by substituting Am for t, and using that 
Am. -—> oo. Conditioning on the random walk to stay positive yields: 

iV(A„)e-(^-i)^-|5i >0,...,Sm>0)^Wx. (3.16) 



Now N{t) is the number of alive individuals at time t, so after m splits N^Am)/^!. = Sm/iTT- -—> 
A — 1, by the strong law. Hence, with 7 = 1/(A — 1), 

{Am.\Si > 0, . . . ,5^ > 0) - 7logm ^^ -7log7VFA. (3.17) 

■ 

Proof of Proposition 3.7. We start with the proofs of Part (b) and Part (c). Note that 
Wm = o(loglog?Ti), and that the random variables Dj with j > Wm are independent. The proof 
of (b) is then straightforward from the large deviation properties of the Poisson distribution and 
a simple union bound. For (c) we write 

1=1 i=\ j<loglogm j=[loglogm] 

For the second sum on the right-side we combine the bounds in (b) and use Chebyshev's inequality 
on the i.i.d. Poisson random variables Di, i > log log m, to obtain 

E 7Y^ = /31ogm + 0,(l). 



.(A-l)i 

i= [log log m] 



10 



The first sum on the right-side is obviously of order Op (log log m). This proves (c). 

For the first statement in (a), we observe that {Si > Vi = 1, . . . ,Wm} and {Sm > x} are both 
increasing in the i.i.d. random variables {Di}^^. Therefore, by the FKG-inequality [9] and for 
any j; > 0, we obtain that ¥{Sm > x | 5j > Vi = 1, . . . , Wm) > ^{Sm > x). Therefore, 

P(5m > Im. I 5i > OVi = 1, . . .,Wm) > P(5^ > Im) = I - o{l), (3.18) 

when we take /„ = (A — l)'Wm — Cy/wm log m, and use Part (b). 

We now turn to the coupling statement in Part (a). Denote by P/ the probability distribution 
of the random walk starting at I. Then, using again the notation {Rj}j>o for an unconstrained 
random walk with step size Y^, 

P,„( min 5, < 0) < FiJmmR, < 0) = P(maxi?, < /„) < e"^*'- = o(l), (3.19) 

0<j<m j>0 j>U 

where Rj = —Rj, j > 0, and where 9* > is the unique positive solution of the equation 

0-X + Ae^^ = 0. (3.20) 

Indeed, the final inequality in (3.19) is obtained using the martingale Z„ = nj<n^~ ' which is 
by (3.20) the product of independent unit mean random variables [15, p. 342]. 

Finally, if we take Sj{u}) = SUuj) for j < Wm and all u in the probability space, whereas we 

take for Sj{uj) = S^^{uj) + Y.i=w„,+i ^i(^)' ^°^ J > ^"i, and those to with /„ + Ei=u,„+i ^i('^) > 
0, Wm < j < m and Sj{uj) = otherwise, we obtain our desired coupling. This completes the 
proof of Part (a) and, since the proof of Part (b) and Part (c) has been given above, we are done. 



4 Branching process trees and imbedding of ERRG 

We introduce marked branching process trees in Section 4.1. In Section 4.2, we attach the 
exponentially distributed weights to the edges and obtain in this way a continuous-time marked 
branching process (CTMBP) with binomially distributed offspring. Section 4.3 describes the 
distribution of the height and weight of minimal weight paths in these CTMBP. In Section 4.4, 
we treat the connection between the CTMBP and the ERRG. 



4.1 Marked branching process trees 

In this section we describe a marked branching process tree (MBPT) with a binomially distributed 
offspring, and we show how this MBPT should be thinned to obtain the connection with the 
ERRG. We then attach i.i.d. weights having probability density / on (0, oo) to both the branching 
process tree and the ERRG. We interpret these weights as distances between marks, vertices, 
respectively. We close the section with a proof that for each time t > 0, the set of marks that 
can be reached in the thinned process within time t from the root is identical in distribution to 
the set of vertices that can be reached within time t from the initial vertex iq in the ERRG. 

The set of marks is denoted by [n] = {1, 2, . . . , n}. In the branching process tree, we start with 
a single individual with mark io £ [n], and we put Iq = {io}. This individual reproduces a 
binomially distributed number of offspring Xi, with parameters n — 1 and p. We attach each 
of these Xi children to their father by a single edge, so that we obtain a tree structure. Each 
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edge is assigned a weight taken from an i.i.d. sample having probabihty density /, with support 
(0,00). To obtain the marks of the offspring we take at random a sample of size Xi from the 
set [n] \ {io}- The corresponding set of marks is denoted by /i. Hence the marks are chosen 
uniformly at random from a set of size n — 1 and are all different. Moreover the mark of the 
father (ig) is not present among the set of marks Ii of the offspring. Observe that the marks can 
be seen as vertex numbers of the ERRG; from this viewpoint, the children of individual io, given 
by the vertex-set /i, are the direct neighbors of the vertex iq in the ERRG. 

We proceed by taking the individual for which the corresponding edge-weight is minimal. In 
our language this individual is reached after Ti time units, where Ti is equal to the length of 
the minimal edge weight, since the fluid percolates at rate 1. The mark reached by time Ti is 
denoted by ii. The reproduction process for ii is identical to that of zq: we take a Bin(?i — l,p) 
number of children (denoted by X2) and connect each of them by a single edge to ii. The X2 
'new' edges are supplied by weights taken i.i.d. with probability density /. We update the 'old' 
edges by subtracting Ti from their respective weights, because Ti time units have been used. We 
then complete this step by assigning marks to the new individuals, taken uniformly at random 
from the set [n] \ {ii}, this mark set is denoted by I2. 

We proceed by induction: suppose that we have reached the individuals io,ii, . . . ,ik, in this order, 
so that the (updated) weight attached to the edge with endpoint ik was the minimal edge weight 

at time Ak = Ti + T2 + . . . + T^. We now form Xk+i = Bin(n — l,p) new edges and attach 
them to ik- These edges are supplied by weights, taken i.i.d. with probability density /, whereas 
at the same time, the weights to all other edges emanating from io,ii, . . . ,ik-i are updated by 
subtracting T^. We assign marks analogously as before by drawing them uniformly at random 
from the set [n] \ {ik}- Obviously, this mark set is denoted by I^- 

We thin the MBPT defined above as follows. At each time point A^ = Ti + T2 + . . . + T^ we 
delete the newly found individual with mark i^ and the entire tree emanating from i^, when 

ik ^ {io,k,---,ik-i}, (4.1) 

i.e., we delete ik and all its offspring when the mark i^ appeared previously. Observe that the 
probability of the event (4.1) is at most k/n. The sets {/fc}fc>o are obtained from {/A:}fc>o, by 
deleting the thinned vertices. 

Obviously, for each fixed n, the thinned MBPT will become empty after a finite time, even 
in the super critical case ((n — l)pn > 1), when the unthinned MBPT will survive with positive 
probability. This is because we sample the marks from the finite set [n], and hence with probability 
one at a certain random time all marks will have appeared. We consider the MBPT and its thinned 
version up to the first generation that the thinned process becomes empty. Furthermore, consider 
the ERRG with n vertices and attach weights to the edges independently (also independent from 
the branching process) with the same marginal density / as used in the branching process. Then, 
by the independence of binomial random variables Xi, X2, . . . and the thinning described above, 
for each t > 0, the set of marks that are reached by time t in the thinned MBPT and the set of 
vertices that are reached by time t in the ERRG, are equal in distribution. Since this holds for 
each t > the proof of the following lemma is obvious: 

Lemma 4.1 (FPP on ERRG is thinned CTMBP) Fix n > 1 and p G (0, 1). Consider the 
thinned MBPT until extinction with i.i.d. weights having density f with support (0,oo). If we 
apply independently an identical weight construction to the ERRG, then for any io G [n], the 
weight Wn{io,j) of the shortest path between initial vertex io and any other vertex j £ [n] in the 
ERRG is equal in distribution to the weight of the shortest path between io and j in the thinned 
MBPT. 
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Remark 4.2 (a) In both random environments (thinned MBPT and ERRG) it can happen that 
iQ and j are not connected. We then put the weight equal to +00. 

(b) Observe that since the weight distribution admits a density and is defined on finite objects, the 
minimal weight, if finite, uniquely identifies the minimal path and hence the number of edges on 
this minimal path, the hopcount. Hence, Lemma 4-1 o-lso proves that the hopcount between zq and 
j in the ERRG is equal in distribution to the hopcount between i^ and j in the thinned MBPT. 

4.2 The continuous-time branching process 

We study distance between uniformly chosen, connected, vertices in Gn{pn) with exponentiahy 
distributed edge-weights. Since the vertices of the ERRG are exchangeable we can as well study 
the distance between vertex 1 and vertex 2 conditioned on the event that 1 and 2 are in the giant 
component. For the remainder of the paper we put: 

fix) = e-^ X > 0, (4.2) 

the density of the exponential distribution with rate 1. For the initial vertex we take either io = 1 
or zq = 2, and, if necessary, we will use a superscript to indicate from which vertex we start. Since 
we have a {distributional) embedding of the vertices of G„(p), reached by time t, in the marked 
branching process tree (see Lemma 4.1 and Remark 4.2) we can restrict our attention exclusively 
to the MBPT and its thinned version. 

We start from the individual with mark 1 and let fluid percolate at unit speed from this individual 
until it reaches one of the children. Because the weights are exponentially distributed (see (4.2)), 
the number of individuals that has been reached by time i > form a continuous time branching 
process (CTMBP) [1, Chapter 2]. The alive individuals are defined as those individuals which 
are directly connected to the reached individuals appropriately called wetted individuals. At 
time t = this number is Si = Xi. Then at the first splitting time Ti at which time the 
second individual is wetted the number of alive vertices equals 52 = Xi + X2 — 1 , since X2 new 
individuals are born and one individual gave birth and died (became wet). So, starting from the 
i.i.d. sequence Xi, X2, . . . , we define the random walk 

Sk = Xi + ...+Xk-{k-l), k>l (4.3) 

and the inter-splitting times T^, k > 1, where conditioned on Xi,X2, . . . ,Xfc, the random vari- 
ables Ti,T2, . . . ,Ti^ are independent and where Tj has an exponential distribution with param- 
eter Sj. Indeed, by the memoryless property of the exponential distribution, when t = Aj = 
Ti + . . . + Tj, we have Sj+i alive individuals and all their S'j+i edges, with which they are at- 
tached to the dead or wet individuals have an independent exponentially distributed random 
variable with rate 1 as weight. Therefore the inter-splitting time Tj^i until the next individ- 
ual is wetted, conditioned on -Sj+i, is equal in distribution to mini<s<5^.^-^ Es, where Ei, E2, ■ ■ ■ 
are independent rate 1 exponentially distributed random variables. This shows that the condi- 
tional distribution of Tj+i is the correct one, since the minimum of q independent Exp(l) has an 
exponential distribution with rate q. 

At each splitting time A^, k > 1, the number of wetted individuals equals k + 1, whereas the 
number of edges and endpoints competing to become wet equals Sk+i- We now introduce shortest- 
weight trees. Since the ERRG is imbedded in the CTMBP, we introduce the SWT in terms of 
marks and splitting times. The shortest weight tree SWT^ is equal to the collection of marks 
that are wetted at the k^^ splitting time A^ and we include in the definition the splitting-times 
Ai,A2, . . . ,Ak. So, SWTq = {{io},AQ = 0), the mark of the root, and 



SWTk = ({io,ii,...,ik},{Ao,Au...,Ak}), k>l, (4.4) 
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where io,ii, . . . ,ik, denote the marks reached at the sphtting times Ao,Ai, . . . ,Ak, respectively. 
To connect ij with the sphtting time Aj, we introduce the mapping t{ij) = j, so that mark 
i G SWTfc was reached at time ^i(j) ■ For a fixed mark i € SWT^ and ah marks j ^ i, the random 
variables IjjjSWTfc that indicate whether edge ij is in the branching process tree (or in other 
words: j is one of the alive individuals born out of i) are independent and satisfy 

hj = l\SWTk)=Pn- (4.5) 

Furthermore, let us fix A; > and consider the in total Sk+i alive individuals at the splitting time 
Ak- The edges between the wet mark i and the mark j, corresponding to an alive individual with 
mark j, has weight Eij, i G SWT^. Now observe from the memoryless property of the exponential 
distribution, that, conditionally on SWT^, 

Ei,\SWTk = Ak+i - ^i(,) + Ei^, (4.6) 

where E'^, are independent Exp(l) random variables. 

4.3 Hopcount and weight from root to newly added individuals 

Concerning the height and the weight of the path with minimal weight from root 1 in {SWTfc}fc>o 
to the 771*^ individual that has been wetted we refer to (3.4) and (3.6), respectively, where the 
random variables Ii,l2, ■ ■ ■ ,Im are conditionally independent given Xi,X2, . . ■ , X^, the binomi- 
ally distributed offspring in the tree with root 1, and where dj = Xj, Sj = Sj. Indeed, the height 
of the TTi^^ individual that has been wetted in the CTMBP is equal to the generation Gm of that 
individual and its weight is equal to the splitting-time A^ = Ti + . . . + T^, where conditionally 
on Xi , X2 , . . . , Xm , the distribution of Tj is given by 

T,^E,/Sj, (4.7) 

and where {Ei}^^ are i.i.d. exponential random variables with mean 1, independent of all random 
variables introduced earlier. The CLT for the hopcount Gm and the limit law for the weight A^, 
in the CTMBP can be deduced from Theorem 3.5 in the following way: 

Proposition 4.3 (Asymptotics for shortest weight paths in the CTMBP) Consider 
an i.i.d. sequence Xi,X2, . . . , with a binomial distribution where the parameters n — 1 and pn 
satisfy lim„_^.oo npn = A > 1, and put f3 = A/(A — 1). Further more let Gm and Am. be defined as 
in (3.4) and (3.6), respectively, with di = Xi,i > 1. Then, conditionally on Si = Xi + . . . + Xi — 
(i — 1) > for all 1 < i < m, the asymptotics in Theorem 3.5 (a-b) remain to hold, where the 
limits are independent. 

Proof. According to [11, Theorem 2.9], we can couple X ~ Bin(n— l,p„) and D ~ Poi((n — l)p„), 
by means of a pair of variables (A, D) with A ~ Bin(n — l,Pn) and D ~ Poi((n — l)pn), so that 
for any e > 0, and n sufficiently large, 

P(A ^D)<in- 1)pI < ^^^. (4.8) 

Hence for m = o{n), we can couple the i.i.d. sequence Ai, A2, . . . ,Xm to an i.i.d. sequence of 
Poisson variables Di,D2, ■ ■ ■ , Dm, such that 

P(Ai / Di, for some i < m) < V ^ ^ = m ^ ' -^ 0. (4.9) 

^-^ n n 
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Next, we define {Ji,Ki),l < i < m, conditioned on {Xi,Di), . . . ,{Xm,Dm) as follows. Let 
Ui,U2, ■ ■ ■ , Um be an i.i.d. sequence of uniform (0, 1) random variables. For each i, the sample 
space of (Jj, Ki) is {0, 1}^, and the conditional probabilities are defined by 

F{J, = l\U,) = l . , F{K, = 1\U,) = 1 f,, (4.10) 

i i 

where Sf = {Xi + . . . + Xi) - {i - 1), S'f = {Di + . . . + Di) - {i - 1), and where 1^ de- 
notes the indicator of the set A. Note that the joint distribution of {Ji,Ki), conditioned on 
{Xi, Di), . . . , {Xm, Dm), is completely specified by the probabilities in (4.10). Finally, set 

Am = {{Xl=Di),...,{Xm = Dm)} (4.11) 

Then by (4.9), we have that F{Am.„) -^ 1, and 



E 



E^^ 



Ki)>F{AmJ^l, (4.12) 



because on Am„ we have Ji = Ki, 1 < i < rrin- In order to prove Part (a), we have to show 
that YlT=i "^i conditioned on Sf > 0, 1 < i < m„ has, whp, the same distribution as Yll^i ^i 
conditioned on S[' > 0, 1 < i < m„,. Observe that the statement follows from (4.11) and (4.12) 
since by the coupling introduced above, 

P(5f > 0, 1 < i < m„) ^ 1 - PA > 0. (4.13) 

Hence, Part (a) follows from Part (a) of Theorem 3.5. Referring to Part (b) of Theorem 3.5 and the 
definition of Am in (3.6), we conclude that the proof of Part (b) also follows in a straightforward 
manner. ■ 



4.4 The connection time 

In Definition (4.4) of Section 4.2 we have introduced {SWTfc}fc>o, which includes the set of marks 
of the individuals that are wet after the k split. More precisely, we grow a CTMBP from a 
root with mark 1 and include the marks that are successively reached and their splitting-times 
in {SWT^^'}fc>o. Our plan is to grow SWT^^' until the set of marks has reached size a„ = [-v/nj; 
the branching process then contains a„ wet individuals and the same marks can appear more 
than once. Then, we grow independently, starting from a root with mark 2, a second marked 
branching process. We denote the shortest weight tree, i.e., the wetted marks and their splitting 
times of this second process by {SWTJi^^}fc>o. We stop the first time that this second process 
contains a mark from SWT^^\ i.e., we stop growing {SWT^^ }fc>o at the random time C„ defined 

by 

C„ = min{m > : SWT[^^ n SWT*,^^ / 0}, (4.14) 

where SWT^!' n SWT'^^ denotes the common marks in SWT<!) and SWT'^^ We now reach 
the main theorem of this section, where we establish the connection between the weight Wn 
and the hopcount Hn in the ERRG and the weight and height in the shortest-weight trees 
{SWTJj*^}fc>o, i = 1,2, introduced in this section. 

Theorem 4.4 (Connecting hopcount to heights in CTMBP) Let Hn, Wn denote the hop- 
count and the weight, respectively, of the minimal path between the vertices 1 and 2 in the Erdos- 
Renyi graph Gn{pn), where we condition on the vertices 1 and 2 to he in the giant component, 
(a) For n — >• 00, whp, 

Hn^G^a^+G^l. (4.15) 
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where Gu^^a„ is the height (or generation) of a uniformly chosen mark in SWT^j^^, and Gq is 
the height of the mark attached at the random time C^ defined in (4.14), and where conditioned 
on C„ the variables Gjj ^ and Gq are independent, 
(b) Similarly, for n — >• oo, whp, 



d 



Wn^Ai^l+A^l, (4.16) 



where Am, i = 1,2, is the splitting-time of the m^^ mark in {SWTl''}fc>o. 






For the moment, we postpone the proof of Theorem 4.4. We first need to estabhsh two interme- 

-A; ' 



diate results. Observe that if in SWTi,'', i = 1,2, all marks on the minimal weight path between 



the root i and some other mark U S SWTJ, appeared for the first time, then the entire path is 
contained in the thinned branching process, and by Theorem 4.1 and the remark following that 
same theorem both the weight and the hopcount in the ERRG between the vertices i and U 
are equal in distribution to G^ and A^ , where Gjj denotes the generation of mark U and Ajj 
denotes the splitting-time of U. Hence, in order to prove Theorem 4.4, we need an upper bound 
on the expected number of marks in SWTj, whose minimal weight path contains thinned marks. 
This will be the content of the first lemma below. The second lemma below shows that Cn/cin 
converges to an Exp(l) random variable, so that we have a handle on the size of C„. 

Lemma 4.5 (A coupling bound) Fix k > 1 and denote by Mj^ , i = 1,2, the number of marks 
in SWTJ^'' for which the minimal path from root i to that mark contains a thinned mark (or 
individual). Then 

E[Mf]<^^, i = l,2. (4.17) 

n cifi 

The above inequality holds in SWT^^' for 1 < A; < a„ and in SWT^^' for 1 < k < C^. 

Proof. We start with SWT^^ . Let SWT^ contain Mj^ marks whose minimal path contains 
at least one thinned mark. The new mark which is drawn to be the mark of the newly wetted 
vertex in SWT^', j^ attaches to one of these Mf} marks with probability Mj} /{k + 1), since 
ISWTJj' I = A: + 1. If the new mark is attached to a vertex with one of the A; + 1 — Mj}'' other 
marks, then with probability at most [k + l)/n this mark appeared previously and has to be 
thinned. Hence 

E[M«, - MflM^^)] < ^ + ^. (4.18) 

For SWT^^' the fraction ^^^ in the recursion should be replaced by ^^^ , because for each 
1 < fc < Cn we know that SWT^^' contains at most On different marks. Write b^ = E[M^^']. 
Taking double expectations in (4.18) and solving the recursive inequality yields: 

< 1- 1-, so that < — . 

k + 1 n k k + 1 n 

This yields E[A/^ ] < ^. The proof for SWT^^ is similar, however since at most an marks are 
occupied by SWT^^^, we replace n by n — a^ and this yields (4.17). ■ 

Lemma 4.6 (Weak convergence of connection time) Conditioned on the event that both 
CTMBPs survive, the connection time C„, defined in (4.14), satisfies the asymptotics 

Cn/an A E, (4.19) 

where E has an exponential distribution with rate 1. 
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Proof. Define by Qn the conditional probability given both SWT^^^ and SWT-^ . Similar to [3, 
Lemma B.l], the probability P(C„, > m) satisfies the following product form: 

m 

P(C„ > m) =E[llqli\Cn >j\Cn>J- 1)] . (4.20) 

i=i 

We omit the proof of (4.20), which follows by suitable conditioning arguments and is identical to 
[3, Proof of Lemma B.l]. Since both branching processes survive, SWT^^' contains o„(l + o(l)) 
different marks and by (4.17) the expected number of multiple marks is 0(1). From this we 
obtain that as long as the second branching process does not die out, 

nUXr. . .ir. . . 1^_. a„(l + o(l)) 



\Cn>j\Cn>j-l) = l 

n 
Substitution of this into (4.20) with m = a„x, and x > 0, yields 

anX 



P(C„ > a„x) =\l- ""^^ °^^^^ " ^ e-^ (4.21) 

L n J 

and we arrive at (4.19). ■ 

Proof of Theorem 4.4. We start with the proof of the hopcount. Since 1 and 2 are connected, 
with high probability, 1 and 2 are contained in the giant component of the ERRG. This implies 
that both CTMBP's do not die out. Consider the mark set SWT^^' and choose one of the wet 

tin 

individuals at random. The mark number of this individual will be denoted by Un- Then the 
probability that the shortest weight path from root 1 to Un contains a thinned mark is at most 

_L^2lI < > 0. (4.22) 

Qn ^ ^n 

This shows that whp the hopcount between vertex 1 and vertex f/„, in G„(p„) is given by G^ff ^ . 
By (4.19), Cn = Op{^/nlogn), and a similar argument shows that, whp, the hopcount between 2 
and the individual corresponding to the last mark added to SWT^^' is equal to G^^ . 

Now recall the discussion around (4.5) and (4.6). We conclude from this discussion that indeed 
at time C„ one of the marks in SWT^^' is chosen uniformly at random and this establishes (4.15). 
Finally, we introduce the notation p(i) for i G SWT^., to denote the parent of mark i, i.e., the 
mark to which i was attached in SWT^. Because of the way the exponential weight in (4.6) can 
be re-allocated to edges in SWT^^^, the edge {p(i),i} gets weight ^a„+i — ^t(i) and the weight 
E'^j is allocated to ij, with i G SWTW and j ^ SWT^'^, so that we get (4.16). ■ 



5 Proof of the main results 

In this section, we complete the proof of our main results. 

Proof of Theorem 2.1. Observe that the statistical dependence between the two CTMBPs, 

i.e., the one that grows from the root with mark 1 and the one that grows from the root with 
mark 2, is introduced only through their marks. Moreover, in each tree, the hopcount of the m 
added individual is independent of the marks. Hence, conditioned on the event {Cn = rn}, the 
random variables G'ff ^ and G[^' = Gm are statistically independent. 
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Denote by Z4'' = [G'^jla^ - /31ogo„)/V^bg^ and by Z'^^ = (Gg^ - /31ogC„)/V/:^log C„, and 
by ^1, ^2 two independent standard normal random variables. Then, by Proposition 4.3, for any 
sequence m„ = o(n) — )• oo, and for any bounded continuous function g of two real variables 

lim E[5(ZW,Z(^))|C„ = 7n„] =E[g(6,e2)]. (5.1) 

n— >-oo 

Hence by bounded convergence (the function g is bounded), and using (4.19), 

lim E[5(ZW,Z(^')] = / lim E 5(^^\^f )|C„ = \any^ ^(C„ < Kyi) 

oo 

]Eb(ei,e2)]e-^dy = Eb(ei,e2)]. 



This shows the joint weak convergence of the pair {Zn\Zn) to (^i,'?2)- By the continuous 
mapping theorem [4, Theorem 5.1] applied to (x,y) i— t- x + y, we then find that 

gn-/31ogn ^ g[)!,a„-(/3/2)log^ ^ Gg;^-(/3/2)logn 
Vlogn -v/Iogn i/log^ 

g'[)!.a„- /Slogan Vb^^ Gg^-/31ogC„ Vb^a , ,^, 

= 7^ 7^= + /^ — n- 7^^+Op(l) 

Vlogfln Vlogn VlogCn Vlogn 

^ 6/^^ + 6/^^, (5.2) 



and we note that (^i + ,^2)/v 2 is again standard normal. ■ 

Proof of Theorem 2.2. Similarly to the proof of Theorem 2.1, for any sequence m„ — )■ oo, 

A^^l - 7 log an, All - ^ log C„|C„ = ?n„) A (Xi, X2), (5.3) 

with Xi,X2 two independent copies of X = —^\og{'^W\). We rewrite the second coordinate as: 

A^^^ - 7 log C„ = A[^^ - 7 log a„ - 7 log(C„/a„). 
Hence, from (5.3) and the limit law for C^/an given in (4.19), we obtain: 
W;,-7logn = A^^l+Al\-2-i log an 



Xi+X2+-f\og{E), 



= A^^l - 7 log an + Ag^ - 7 log C„, + 7 log(C„,/a„) 

d 



where E = Exp(l) is independent of (Xi,X2). We can reformulate this as 

(A - l)Wn - logn ^ -log(7VF{'^) - log(7Wf - M, (5.4) 

where M = —\og{E) has a Gumbel distribution, i.e., P(A/ < x) = A(x) = exp(— e~^). ■ 

Proof of Corollary 2.4. We start with a proof of (3.10), with j3 replaced by /?„ = A„/(An — 1). 
We give the proof for the CTMBP with Bin(n, A„/n) offspring distribution (thus avoiding the 
coupling with Poisson random variables in Proposition 4.3). Since, for A„ — ?• 00, any two vertices 
are, whp, connected it is not necessary to condition on the event {Si > 0, 1 < i < m}, and 
therefore it is straightforward from (3.4) and (3.5) that 

m 

Gm. = Y,Ii, where P(/, = l|{Z?,'"'}i<,<„) = Z^f V^''^', (5.5) 



i=l 



18 



with D^;'\D^^\...,D'i^\ independent Bin(n,A„/n) and 5,?"' = E}=i ^j"' - (^ - !)• By [14, 
Chapter 3], the random variables D^ concentrate around An, whereas S" concentrates around 
i{\n — 1) + 1. Therefore Gm in (5-5) satisfies the same asympotics as 



m 



with /j"', I2"', . . . , Im independent. Applying Lemma 3.4, we conclude that for m = rrin —5- 00, 

Gm-/3„X:^=llA 



VEr:i(/5nA)(i-/3n/i) 



Z, (5.6) 



where Z is standard normal. Note that the denominator of (5.6) can be replaced by ^logm, 
since (3n ^ ^ and hence J2'iLi{/^n/i){^ — Pn/i)/^ogm — )• 1. This proves (3.10), with (3 replaced 
by Pn = )^n/{)^n — 1)- The centering constant l^nYlTLi 1/* '^an be replaced by fin^ogm, but in 
general not by logm, except when {fin — l)y/\ogrn^ — )• 0, or equivalently A^ = o{y/\ogrn^). 



We now turn to the proof of Part (b) of Corollary 2.4. Again, we will use that S*]" concentrates 
around i{\n — 1) + 1. Hence, from (3.6), 

m ^ m 

i=l " i=l 

Now define Bn as a random variable with distribution equal to the maximum of n independent 
exponentially distributed random variables with rate 1, i.e., 

HBn < x) = (1 - e"^)". 



Then Ei/i, 1 < i < n, are equal in distribution to the spacings of these exponentially distributed 

I^onse 

lim F{j~^Ai'l - logo„ <x)= lim P(5„„ - loga„ < x) 



variables i.e., Bn = X]"=i Ei/i. Consequently, with 7„ = 1/(A„ — 1 



= lim (1 - e-^-+i°s«")"" = A(x), (5.7) 

n— >oo 

where A denotes the distribution function of a Gumbel random variable, i.e., A(x) = exp(— e~^). 
Parallel to the proof of Theorem 2.2, we conclude that 

(7-^<: - loga„,7-^4'i - logCn) A (Mi,M2), 

where Mi and M2 are two independent copies of a random variable with distribution function A. 
Finally, since Cn/cLn converges in distribution to an Exp(l) random variable, we conclude from 
the continuous mapping theorem that (2.5) holds. ■ 

Proof of Theorem 2.6. This proof is a consequence of Theorems 2.1-2.2 and the results on the 
diameter of G„(A/n) proved in [8]. We shall sketch the proof. We start by recalling some notation 
and properties of the ERRG from [8]. Let the 2-core of Gn{^/n) be the maximal subgraph of 
which each vertex has degree at least 2. The study of the diameter in [8] is based on the crucial 
observation that any longest shortest path in a random graph (with minimum degree 1) will occur 
between a pair vertices u^v of degree 1. Moreover, this path will consist of three segments: a 
path from u to the 2-core, a path through the 2-core, a path from the 2-core to v. While this is 
used in [8] only for graph distances, the same applies to FPP on a graph. Now, when we pick two 
uniform vertices i and j (as in Theorems 2.1-2.2), then the paths from i and j to the 2-core are 
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(a) unique and (b) whp of length o(log n) (since the giant component with the 2-core removed 
is a cohection of trees of which most trees have size o(logn)). As a result, Theorems 2.1-2.2 also 
hold when picking two uniform vertices in the 2-core. 

Next we investigate the maximal weight and length of shortest-weight paths in the ERRG. In [8, 
Section 6.1], it is shown that whp there are two paths of length at least (1/log (— ^a) — £) logn in 
the giant component with the 2-core removed connecting two vertices i* and j* of degree 1 to two 
vertices U, V in the 2-core. Since these paths are unique, they must also be the shortest-weight 
paths for FPP on the ERRG between the vertices i* and U, and j* and V, respectively. The 
weights along these paths is thus a sum of (1/ log (— ^a) — s) log n i.i.d. Exp(l) random variables. 
By exchangeability of the vertices, U, V are uniform vertices in the core. Thus, the shortest- 
weight path from i* to j* has at least A/(A — 1) log n + Op(log(n)) -|- (2/ log (— /Ua) — 2e) log n hops, 
as required, while its weight is at least 1/(A — 1) log n + Op(log(?i)) -1- (2/ log (— /-^a) — 2e) log n. m 
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